AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Low-latency audio processing

# Low-latency audio processing

Voila Chat
MIT
Voila is a brand-new large-scale speech-language foundation model series designed to elevate human-computer interaction to unprecedented levels.
Text-to-Audio Transformers Supports Multiple Languages
V
maitrix-org
2,423
32
Sanji
This is a real-time voice conversion (RVC) model named 'Sanji', designed for audio-to-audio conversion tasks.
Speech Synthesis Transformers
S
sail-rvc
208
0
Ai Light Dance Stepmania Ft Wav2vec2 Large Xlsr 53 V1
Apache-2.0
This model is an automatic speech recognition model fine-tuned from wav2vec2-large-xlsr-53 on the GARY109/AI_LIGHT_DANCE - ONSET-STEPMANIA2 dataset.
Speech Recognition Transformers
A
gary109
48
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase